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Abstract: This paper aims to provide insight into stability of collaboration choices in P2P networks. We study net- 
works where exchanges between nodes are driven by the desire to receive the best service available. This is the case 
for most existing P2P networks. We explore an evolution model derived from stable roommates theory that accounts for 
heterogeneity between nodes. We show that most P2P applications can be modeled using stable matching theory. This 
is the case whenever preference lists can be deduced from the exchange policy. In many cases, the preferences lists are 
characterized by an interesting acyclic property. We show that P2P networks with acyclic preferences possess a unique 
stable state with good convergence properties. 
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Applications de la theorie des affectations a 1' etude des reseaux P2P 

Resume : Cet article vise a decrire la possible stabilite d'un reseau pair-a-pair en termes de collaborations. Nous etudions 
les reseaux ou les echanges entre pairs sont bases sur la volonte d'obtenir le meilleur service possible (hypothese valable 
pour une grande partie des reseaux P2P existants). Nous considerons un modele d'evolution base sur la theorie des stable 
roommates afin de prendre en compte l'heterogeneite des pairs. Nous montrons comment etudier la plupart des reseaux 
P2P a Faide de ce modele. Dans la plupart des cas rencontres, le systeme de preferences sous-jacentpossede unepropriete 
d'acyclicite. Pour de tels systemes, nous montrons 1' existence d'une unique solution stable avec de bonnes proprietes de 
convergence. 

Mots-cles : pair-a-pair, mariages stables, proprietes de convergence 
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1 Introduction 

During the last few years, peer-to-peer (P2P) applications have emerged and now appear as a major component of the 
Internet, from both traffic and content distribution points of view. Peer-to-peer network performances scale reasonably 
with the number of users. This definitely makes P2P a leading paradigm for tomorrow's networking applications. One 
of the most striking achievements is certainly BitTorrent protocol [Coh03| for content distribution. It is based on a Tit- 
for-Tat mechanism, which is used to regularly compute collaboration links between the peers. Despite the simplicity of 
this approach, measurements and analysis are hard to provide due to the massively distributed nature of the applications. 
Starting from this statement, we present a formal method to analyze a large class of networks (including BitTorrent-like 
applications) with regard to the stability of collaborations. 

More precisely, our main contribution is a model that fits any peer-to-peer protocol where peers are allowed to choose 
partners they are collaborating with. We just suppose that each peer ranks other peers according to some preference func- 
tion. For example, in a P2P file-sharing network, each peer can rank other peers according to the similarity of their interest. 
In a cooperative download application such as BitTorrent [Coh03|, the upload bandwidth appears as a major parameter 
as the incentive mechanism consists in selecting collaborators based on how much they upload. Additional parameters 
like download bandwidth, latency, storage capacity or even manual choices can also be used to define the preferences. 
Even though exact mechanisms of P2P solutions can be more complicated, our modeling gives a first approximation. In 
particular, it explains the evolution of a system where collaborators are selected according to such a parameter. 

This work aims to deduce properties of the connection graph induced by the preference system chosen by a P2P 
application. The paper is structured as follows. In the next section, we present existing formal theories, namely the stable 
marriage problem [GS62| and the stable roommate problem |CF05|. In Section[3]we apply these theories to develop a 
model that includes many P2P applications. Section[4]identifies three main preference classes that appear in existing P2P 
applications. All of them appear to be cycle free and can be seen as particular cases of what we call acyclic preferences. 
In Section[3]we focus on acyclic preference systems and present a corresponding stability result. The convergence speed 
is discussed in Section [6] 

2 Background: matching theory 

Stable marriage problems were introduced by Gale and Shapley in 1962 [GS62|. An instance of the Stable Marriage 
problem involves two sets of participants, conveniently called the set of men M and the set of women W . A common 
assumption is that every member of each gender has strict preferences over the members of the opposite gender 1 . The 
purpose of the theory is to find and describe the stable matchings (or configurations) between M and W. A matching M. 
is said to be unstable if there is a pair (m,w) where each one prefers being matched with the other rather than being in its 
current situation in M. This pair is said to block the matching M., and is called a blocking pair for M.. A stable matching 
is a matching with no blocking pair. Using a concept of proposals, Gale and Shapley have shown that all instances of the 
marriage problem possess at least one stable state that can be reached in O(hf) proposals, where h is the number of men 
and / the number of women. 

If there is only one set of participants (called peers), where anybody can be matched with anybody, then we get a 
different problem, called the Roommate problem. This change has two important consequences. Firstly, the existence of 
a stable configuration is no longer guaranteed, and secondly, a proposals algorithm like the Gale-Shapley algorithm may 
not converge, even if a stable configuration exists. 

The first issue has been addressed by Irving in IHrv851 . The Irving algorithm finds a stable configuration to the 
roommates problem, if there is one, or it indicates that no solution exists. However, to the best of our knowledge, an 
adaptation of the Gale-Shapley algorithm to the roommates problem is still to be developed. One of the contributions of 
this work is a natural extension of Gale-Shapley to the stable roommates theory for the special case of acyclic preferences 
(see Section|5]l. 

Finally, the roommates problem can be further generalized by allowing any peer p to establish a number b(p) of 
simultaneous partnerships (instead of a single one as in the classical roommate matching problem). This generalization 
is often called many-to-many matching in the bipartite case or ^-matching in the general case[Fle03 , KU03 1. Cechlarova 
and Fleiner [CF05| show how a fo-matching problem can be transformed into an equivalent 1 -matching problem. We 
propose to use ^-matching to model the connections in a P2P network, as detailed in the following section. 

'The Stable Marriage with Ties problem (SMT) raises issues that will not be addressed in this paper. For existing studies on SMT, see [IMMM99 
IMan021 . 
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3 Networks as matching instances 

P2P networks are formed by establishing an overlay network between peers. Any peer acts both as a server and a client. 
Each peer p uses a bounded number of connections. As the network evolves, peers continuously seek after new (or better) 
partners. Each protocol implements its own algorithm for this searching phase. But in most of the cases, its output can 
be seen as a preference list over the contacts. Thus, a P2P protocol algorithm for connecting peers can be modeled as an 
instance of a ^-matching problem. 

As for the stable roommates problem, we consider a set P of n peers. All possible connections between the peers are 
defined by an acceptance graph. Each peer p has a quota b(p) on the number of mates (connections). In addition, all 
neighbors of p from the acceptance graph are sorted according to a given preference system and form a preference list 
denoted by L(p). L(p, q) denotes the position (rank) of peer q in p's list. In other terms, L(p) is a permutation of all 
neighbors of p. If L(p, q) < L(p, r) then we say that p prefers q to r. The best rank corresponds to 1. The degree of 
a peer is the length of its preference list. If we denote by L a vector of preference lists corresponding to all peers, then 
(P, L) (if there is no ambiguity P is omitted) defines an instance of the Roommates Problem. 

For simplicity we consider only undirected acceptance graphs: p S L(q) iff q G L(p). There is no loss of generality 
since pairs are formed only between peers that mutually accept each other. Let m denote the number of edges of the 
acceptance graph. 

When a partnership is established between two peers p and q, we say that each one is a mate of the other, or equivalently 
that the pair {p, q] is formed. A configuration C is defined as a set of formed pairs {p, q} such that each pair p has at 
most b(p) mates. Some peers may be single (i.e. not paired). The set of all configurations is called C. It contains the 
trivial configuration Cg, where no peer is paired. 

In a configuration C, we say thatp is under-mated if it has less than b(p) mates in C. A blocking pair for a configu- 
ration C is a pair {p, q] ^ C such that each member of the blocking pair is either under-mated or prefers the other to its 
worst mate in C. 

We propose to model the evolution of the system through initiatives, a natural extension of the Gale-Shapley initiative 
algorithm 1GS621 . An initiative is the process by which a peer may change its mates. Given a configuration C, we say 
that peer p takes the initiative when it proposes to other peers to be its new mate. Basically, p may propose partnership to 
any acceptable peer. However, a new partnership is only interesting when a blocking pair exists. If a peer p is part of a 
blocking pair (p, q) and elopes with q, the initiative is then called active because it modifies the configuration (both peers 
will change their set of mates). 

To find such a new mate, p searches its preference list avoiding peers that do not improve its situation. We identify 
several strategies depending on how p searches its preference list: 

• best mate: p seeks the best peer with which it forms a blocking pair. 

• decremental mate: p circularly scans L(p) starting from the position of the previous initiative. 

• random mate: p chooses at random among the blocking pairs it belongs to. 

Best mate seems to be the best strategy from a peer's point of view. However, when making proposals takes a valuable 
time, it may not be realistic. For this reason, we consider the two other types of initiative which are more suited to model 
simultaneous asynchronous initiatives. For instance, consider an application where peers are not aware of their neighbors' 
value. To try to find a better mate, a peer will select a random neighbor, probe it and keep collaborating with it if it is 
more interesting than some previous mate. This is the random mate strategy. 

Let us now consider how preferences usually appear in P2P networks. 

4 Acyclic networks 

In this section, we show that most current P2P networks tend to conclude partnerships based on preferences which appear 
to be acyclic. A preference cycle between k > 3 peers pi-..pk occurs if pi prefers pi+\ to pi-\ (modulo k). A preference 
instance is acyclic if it contains no preference cycle. 

First consider networks where mates are selected according to some inherent capacity like available bandwidth 2 , 
computing capacity, or storage capacity. In such a system, a peer p possesses an intrinsic mark m(p) acknowledged by 
all the peers it knows. Peers with higher marks are preferred. The preference lists resulting from such a policy are called 
global preferences. Consider a preference chain pi-..pk where peerp^ prefers pi + i to Pi-i. As marks increase along the 
chain, it cannot form a preference cycle, and global preferences are always acyclic. 

As an example, consider the "Tit-for-Tat" strategy of BitTorrent [Coh03 |. Each peer prefers to exchange with peers 
with the best upload capacity: as such peers provide data at a higher rate, they appear as best uploaders as soon as a 

2 Network available bandwidth often mainly depends on the type of the peer Internet connection and how much is consumed by other concurrent 
applications. 
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steady sequence of chunk exchanges is initiated. BitTorrent's Tit-for-Tat policy is thus close to a global preference system 
according to upload capacity. In addition to best uploaders, each peer also serves a random contact. This "generous" 
connection can be seen as a probing mechanism for finding mates with better upload capacity. 

However, it should be noted that peer selection also relies on the complementarity of file chunks. This can be further 
modeled by a second type of preference system that we call complementary preferences. In such a system, all peers try 
to get the same set of resources (such as file chunks in cooperative file download). Each peer then prefers to exchange 
resources with peers possessing the largest number of its missing resources. It can be shown that such complementary 
preferences are also acyclic. Notice that this kind of preference changes as blocks are downloaded. However, the peers 
with the largest complementary set of blocks are those enabling the longest block exchange sessions. In its most general 
form, the selection of peers for cooperative file download can be seen as a mix of two acyclic preference systems. 

Finally, we identify a third class of acyclic preference systems where each peer p gives a mark m(p, q) to each peer q 
it knows in such a manner that marks are symmetric: m(p, q) — m(q, p) for all p, q. Each peer prefers to pair with peers 
with the best marks. Such a preference system is said to be symmetric. Again the marks increase along a preference chain, 
preventing the existence of a preference cycle. The simplest example of such a preference system comes from latency 
optimization. Consider an overlay scheme such as Pastry [RD01 1 that is optimized by selecting contacts with the smallest 
round trip time (RTT) in the physical network. As the RTT is a symmetrical measure (on average), it results in symmetric 
marks. 

In fact, any selection mechanism induced by proximity according to some distance function results in such sym- 
metric preferences. For example, massively multiplayer online games (MMOG) require to connect players with nearby 
coordinates in a virtual space. This can be modeled by symmetric preferences based on distance in the virtual space. 
Similarly, some authors propose to connect participants of a file sharing system according to the similarity of their inter- 
est [FHKM04 SMZ03 |. Any such preference system based on proximity is symmetric and thus acyclic. 

We have seen that many P2P networks are formed through peer selection algorithms that can be modeled by preference 
instances that are acyclic. We now consider the stability properties of such preference systems. 



5 Stability result 

While it is difficult to find the stable solutions for general preferences in the roommates problem, the issue for acyclic 
preferences is much simpler, as shown by Theorem[T] 

Theorem 1 An acyclic b-matching preference instance always has a unique stable configuration. 

Proof: We first prove by contradiction that there can be at most one stable solution when preferences are acyclic. Suppose 
A and B are two distinct stable configurations of the instance. There exists a peer p\ with different mates in A and B. 
Let p2 be the best mate among the mates p\ is matched with in A or B, but not in both configurations. Assume, without 
loss of generality, that p\ is mated with p2 in A, but not in B. As B is stable, p2 has 6(^2) mates in B it prefers to p\ 
(otherwise {pi,P2} would be a blocking pair for B). At least one of them is not its mate in A; let p$ be the best ranked. 
For a similar reason, p^ has a mate p^ in A and not in B it prefers to p2- We iterate the process to construct a sequence 
of peers (pi)i>i. As the set of peers is finite, a peer pk is found that is already present in the sequence. Let us take the 
smallest k such that pi = pt for some i < k. The choice of Pi+\ implies that pi prefers Pi+i to pt-i- By construction, 
the circular list Pi,ps,..., Pk-i is a preference cycle. This contradicts the fact that the instance is acyclic. 

To ensure the existence of a stable solution, we now prove that when preferences are acyclic, a sequence of active 
initiatives (i.e., initiatives that change the configuration) never goes twice through the same configuration. As there is a 
finite number of possible configurations, if we keep altering the configuration through initiatives, we eventually reach a 
configuration that cannot be altered with any initiative: a stable configuration. 

The proof is simple. If a sequence of initiatives induces a cycle of at least two distinct configurations, then one can 
extract a preference cycle: let p\ be a peer whose mates change through the cycle. Call P2 the best peer p\ is unstably 
paired with during the cycle, and p% the best peer P2 is unstably paired with during the cycle. p\ is not p^ and P2 prefers 
P3 to pi, otherwise the pair {pi,pz} would not break during the cycle. Iterating the process, we build a sequence of peer 
(pk) such that pk prefers Pk+i to Pk-i, until we find i < j such that pi = pj. The circular list (pi,Pi+i, ■ ■ ■ ,Pj-i) is a 
preference cycle. □ 

Theorem[T]can be also proved using Tan's decomposition theory |Tan91|. However, our proof has the advantage of 
leading to CorollaryQ] 

Corollary 1 Any sufficiently long sequence of active initiatives leads to the unique stable configuration. 
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Any initiative algorithm, starting from any configuration, computes the stable configuration if we assume it gives 
enough initiatives to active peers. This statement alone gives no guarantee on the convergence speed (except a factorial 
bound derived from the number of possible configurations), but more insight is given next. 

6 Convergence results 

In order to understand acyclic dynamics, we shall introduce the concept of loving pair. A loving pair {p, q] is a pair 
of peers such that peer p is ranked first by peer q and vice versa. Loving pairs are the key to understanding acyclic 
preferences and convergence to the stable state. They have two main properties: first, loving pairs are unbreakable. Once 
peers of a loving pair are matched together, no sequence of initiatives can unmatch them. The second property is given by 
Theorem[2] An instance is said to be trivial when all preference lists are empty. 

Theorem 2 Any non-trivial acyclic preference instance always has at least one loving pair. 

Proof: Consider a non-trivial preference instance. There exist 2 peers po, p\ such that L(po,pi) = 1. If L(pi,po) = 1, 
then {po,pi} is a loving pair. Otherwise, there exists p^ ^ po such that L{pi 1 p2) — 1. If we continue this process, we 
eventually find a loving pair pi, Pi+\ such that L(pi,pi + i) = L(pi + i,pi) = 1. If this is not the case, we construct a 
sequence po, . . . ,pi, . . . such that L(pi^i,pi) — L(pi,pi + i) = 1, with pi-\ ^ Pi+i- As the number of peers is finite, the 
sequence loops, producing a preference cycle. □ 
Because of loving pairs, the stable solution can be constructed pair by pair through initiatives. This is stated in 
Theorem[3] 

Theorem 3 For any acyclic preferences instance, starting from any initial configuration C, there exists a sequence of at 
most initiatives leading to the stable solution, where B — ^2 peP b(p). 

Proof: Theorem Q] guarantees that a stable configuration exists for any acyclic preference instance. We show that this 
stable state can be reached by matching loving pairs. We say that a pair is stable when no sequence of initiatives can 
break it. All stable pairs {p, q} can be taken out of consideration: we can virtually remove them from the acceptance 
graph, erasing each peer from the preference list of the other, and decrementing the quotas b(p) and b(q). Similarly, a peer 
p that appears in b(p) stable pairs will never change its mates, which is equivalent to considering a preference instance 
where p has been removed. In both cases, we obtain a smaller, but strictly equivalent preference instance. Moreover, this 
preference instance remains acyclic. 

Starting from any configuration, we first remove all stable pairs and peers with a full quota of stable pairs as described 
above. The equivalent preference instance is acyclic. As long as it is non-trivial, Theorem [2] implies the existence of 
loving pairs. We give a best mate initiative to one peer of a loving pair. The loving pair is then formed and it is stable. We 
can remove it from consideration. It results in an equivalent preference instance where B is decremented by 2. If one of 
the peers has now a quota, it can be removed from the preference instance. By iterating this process at most % times, 
we end up in a configuration with a trivial equivalent preference instance where all preference lists are empty. This final 
configuration is thus the stable solution of the initial preference instance. □ 

The above algorithm for computing an optimal initiative sequence is hard to implement in a massively distributed 
environment. However the key of Theorem|3]is that the stable solution is made of at most B/2 stable matchings, and that 
at each moment, one of them is a loving pair for the current configuration. With the best mate strategy, a loving pair is 
formed as soon as one of its peers has the initiative. We can estimate the convergence speed by estimating the time needed 
to match loving pairs. For instance, consider two simple random algorithms: periodic and Poisson. 

In the periodic algorithm, each peer takes a best mate initiative every t seconds. Due to network latencies, we consider 
that the order of initiative may be different in each period. By Theorem|2] with best mate, a loving pair is formed after at 
most t seconds. Thus, in an acyclic preference instance, with periodic initiatives, the stable configuration is reached after 
at most yt seconds. 

In the Poisson algorithm, at any step a peer drawn uniformly at random with probability — takes an initiative. A 
classical balls and bins result states that, with high probability (w.h.p.), each peer will have taken the initiative at least once 
after n log n drawings. Thus, in an acyclic preference instance, with best mate Poisson initiatives, the stable configuration 
is reached in 0(nB log n) initiatives w.h.p.. 

The mean convergence time is much smaller. Considering that in each unstable configuration there exists at least 2 
peers from a loving pair, the mean time between the creation of two stable pairs in a best mate Poisson initiatives sequence 
is at most ^. As a consequence, for any acyclic instance and any starting configuration, the mean time to reach the stable 
state with best mate Poisson initiatives is at most 
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7 Future Work and Conclusions 

In the present work we have given upper bound estimates of the convergence time. These results are based on the existence 
of at least one loving pair at every step. Preliminary simulation results lead us to believe that the ^ bound on is tight 
for global preferences, when b — 1 and the acceptance graph is complete. In this case the only loving pair is composed 
of the two best globally ranked peers, which do not yet have paired together. However, we suspect that several other P2P 
networks preference systems, such as symmetric preferences, may contain a large number of loving pairs at a time. These 
systems should converge much faster. We plan to identify and analyze such preference systems. 

Note, that our convergence results assume that the preference lists are static. However, for most P2P networks, the set 
of peers and their preference lists evolve in time. Further work will consider the impact of such dynamics on the stable 
configuration of the system. A major interest is to compare convergence speed to the system evolution speed. As long as 
it is fast enough, we can expect that the system will smoothly follow the evolving stable configuration target. 

We have shown how collaboration selection algorithms can be modeled using the matching theory. In fact, most of 
these algorithms lead to acyclic preference instances and we have proved that such preference systems always evolve 
towards a stable configuration. Additional insight was given on the convergence time which is required to reach this 
stable configuration. 

Acknowledgment: The authors wish to thank James Roberts for his helpful comments 
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